Skip to main content

All Questions

0votes
0answers
60views

Understanding the concepts of embedding in Roberta architecture?

I'm reading the implementation file of Roberta architecture, specifically in the RobertaEmbedding class, this class has the comment: ...
user avatar
1vote
0answers
827views

Finetuning Mistral or MistralForSequenceClassification for text classification

I need to do text classification and have a dataset of 10K entries. I am considering using mistral and following a tutorial like https://huggingface.co/docs/transformers/training and replace model ...
Karl 17302's user avatar
0votes
1answer
193views

Setting number of rows returned by vector stores

When using vector stores like pinecone or Faiss from langchain, is it possible to set the number of records returned based on similarity search? For example, consider the following code, is there a ...
Karl 17302's user avatar
1vote
0answers
57views

Can I reduce computation by only predicting response tokens in a transformer and still get the same gradients?

I have been looking at the source code of the Stanford Alpaca model and I believe that during inference, the whole instruction + response data is fed into the model normally. Then the instruction part ...
Tianchen Zheng's user avatar
0votes
1answer
2kviews

Compare Strings composed from 2-3 words using NLP/ML(Python)

I have a database of books. Each book have a list of categories that describe the genre/topics of the book (I use Python models). Most of the time, the categories in the list are composed from 1-3 ...
Eitan Rosati's user avatar
2votes
1answer
126views

How to generate a response while considering past questions as well?

User: What is the tallest mountain? Agent: Everest User: Where is it located? # Agent hears: "Where is Everest located?" Agent: Nepal I want to be able ...
angryweasel's user avatar
0votes
1answer
176views

Identify whether two companies are the same

I am trying to solve a problem where I need to map multiple variations of a company name to a single name. For example: say I have a company named ...
Clock Slave's user avatar
0votes
1answer
53views

How to train a sequence labeling model with annotations from three annotators?

I have a dataset of movie reviews annotated by 3 persons. The following example contains one sentence with corresponding annotations from 3 different persons. ...
Ahmad's user avatar
1vote
0answers
72views

How to scrape product data on supplier websites?

I'm currently trying to build a semantic scraper that can extract product information from different company websites of suppliers in the packaging industry (with as little manual customization per ...
johannesha's user avatar
1vote
0answers
81views

Determining if an entity in free text is 'present' or 'absent'; what is this called in NLP?

I'm processing a semi-structured scientific document and trying to extract some specific concepts. I've actually made quite good progress without machine-learning so far, but I got to a block of true ...
vector07's user avatar
1vote
1answer
102views

How to implement or avoid masking for transformer?

When it comes to using Transformers for image captioning is there any reason to use masking? I currently have a resnet101 encoder and am trying to use the features as the input for a transformer model ...
Gibbo0789's user avatar
1vote
0answers
33views

What are the best techniques to perform text simplification?

I'm evaluating the state of the art techniques to translate legal text to simple text, what are the best approaches for a non-English language (Portuguese)?
Miguel Carvalhais Matos's user avatar
0votes
1answer
155views

Get Neural Network to predict a tag/class on a certain word using the surrounding words as context [PyTorch]?

I am somewhat a novice at the topic of Neural Netoworks and PyTorch. I am trying to create a model that takes a word (that I have modified very slightly) and a 'window' of context around it and ...
Justin Cunningham's user avatar
2votes
1answer
125views

How to predict the "word" based on the meaning in a document?

What I mean to say is For example, if I give the meaning of Apple from the dictionary as input to the program, it should give output as Apple. Or I say My day to day job involves monitoring and ...
Sid's user avatar
  • 121
0votes
1answer
4kviews

How to make spacy lemmatization process fast? [closed]

I am applying spacy lemmatization on my dataset, but already 20-30 mins passed and the code is still running. Is there anyway to make it faster? Is there any option to do this process using GPU? My ...
Cathrine's user avatar

153050per page
close